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(57) Techniques for determining electronic docu- 
ment information for a paper document. The electronic 
document information determined for a paper document 
may include information identifying an electronic docu- 
ment corresponding to the paper document. The elec- 
tronic document information may also include informa- 
tion identifying a location where the electronic document 
is stored or a pointer or reference to the electronic doc- 
ument. The electronic document information deter- 
mined for a paper document may be stored along with 
identification code information read from an identifica- 
tion tag that is physically associated with the paper doc- 
ument. The electronic document information for a paper 
document may also be stored in an identification tag that 
is physically associated with the paper document or 
physically associated with another paper document 
generated based upon the paper document. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates generally to the 
field of document processing, and more particularly to 
techniques for locating electronic documents corre- 
sponding to paper documents. 
[0002] Due to the widespread use of computers and 
other data processing systems an increasing amount of 
information is now being stored in digital or electronic 
form. For example, electronic information is stored in the 
form of electronic documents such as files, etc. Exam- 
ples of electronic documents include documents creat- 
ed using application programs such as word proces- 
sors, email programs, image editors, etc., scanned im- 
ages of paper documents, documents stored in various 
formats such as PDF, Postscript, etc., and other types 
of documents stored in electronic form. 
[0003] The vast number of electronic documents cou- 
pled with the proliferation of paper document output de- 
vices such as copiers, printers, and facsimile machines, 
has also greatly increased the number of paper docu- 
ments that are generated using the electronic docu- 
ments. These paper documents include printouts gen- 
erated by printers, photocopies generated by copiers, 
facsimile documents generated by facsimile machines, 
photos, and the like. 

[0004] Due to the large number of electronic docu- 
ments and paper documents, it becomes very difficult 
to identify an electronic document corresponding to a 
particular paper document. The problem is particularly 
acute in office environments. Most office users have had 
the experience of spending a great amount of time trying 
to locate an electronic document corresponding to a par- 
ticular paper document. 

[0005] As a result, the ability to locate electronic doc- 
uments corresponding to paper documents is of great 
importance, especially in an office environment. 

BRIEF SUMMARY OF THE INVENTION 

[0006] Embodiments of the present invention provide 
techniques for determining electronic document infor- 
mation for a paper document. The electronic document 
information determined for a paper document may in- 
clude information identifying an electronic document 
corresponding to the paper document. The electronic 
document information may also include information 
identifying a location where the electronic document is 
stored or a pointer or reference to the electronic docu- 
ment. The electronic document information determined 
for a paper document may be stored along with identi- 
fication code information read from an identification tag 
that is physically associated with the paper document. 
The electronic document information for a paper docu- 
ment may also be stored in an identification tag that is 
physically associated with the paper document or phys- 



ically associated with another paper document generat- 
ed based upon the paper document. 
[0007] According to an embodiment of the present in- 
vention, techniques are provided for processing a paper 

5 document. In this embodiment, a first file descriptor is 
determined from an electronic representation of a first 
paper document. A feature descriptor is determined 
from a first database that matches the first feature de- 
scriptor, the first database storing information identifying 

10 a plurality of feature descriptors and electronic docu- 
ment information for each feature descriptor in the plu- 
rality of feature descriptors, wherein the electronic doc- 
ument information for a feature descriptor comprises in- 
formation identifying an electronic document. Electronic 

is document information associated with the matching fea- 
ture descriptor is determined from the information stored 
in the first database. The identification code associated 
with an identification tag along with the electronic doc- 
ument information associated with the matching feature 

20 descriptor is stored in a second database such that the 
electronic document information associated with the 
matching feature descriptor is associated with the iden- 
tification code. 

[0008] According to another embodiment of the 

25 present invention, techniques are provided for perform- 
ing an operation using a paper document. In this em- 
bodiment, a first identification code is read from an iden- 
tification tag that is physically associated with a first pa- 
per document. An identification code from a first data- 

30 base that matches the first identification code is then 
determined, the first database storing information iden- 
tifying a plurality of identification codes and electronic 
document information for each identification code in the 
plurality of identification codes, the electronic document 

35 information for each identification code comprising in- 
formation identifying an electronic document. Electronic 
document information associated with the matching 
identification code is then determined from the first da- 
tabase. The operation is then performed or initiated us- 

40 ing the electronic document identified by the electronic 
document information associated with the matching 
identification code. 

[0009] The foregoing, together with other features, 
embodiments, and advantages of the present invention, 
45 will become more apparent when referring to the follow- 
ing specification, claims, and accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

so [0010] 

Fig. 1A is a simplified block diagram of a system 
that may incorporate an embodiment of the present 
invention; 

55 Fig. 1 B is a simplified block diagram of another sys- 
tem that may incorporate an embodiment of the 
present invention; 

Fig. 2 is a simplified high-level flowchart depicting 
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a method performed by a paper document process- 
ing device for generating a paper document accord- 
ing to an embodiment of the present invention; 
Fig. 3 is a simplified high-level flowchart depicting 
a method performed by a paper document process- 
ing device for generating a paper document accord- 
ing to another embodiment of the present invention; 
Fig. 4 is a simplified high-level flowchart depicting 
a method performed by a paper document process- 
ing device for physically associating an identifica- 
tion tag with original document 1 1 2 to produce doc- 
ument 1 1 2' according to another embodiment of the 
present invention; 

Fig. 5 is a simplified high-level flowchart depicting 
a method performed by a data processing system 
for determining electronic document information ac- 
cording to an embodiment of the present invention; 
Fig. 6 is a simplified high-level flowchart depicting 
a method of extracting a feature descriptor from an 
electronic representation of a paper document ac- 
cording to an embodiment of the present invention; 
and 

Fig. 7 depicts an example of applying the method 
depicted in Fig. 6 according to an embodiment of 
the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[001 1 ] Embodiments of the present invention provide 
techniques for determining electronic document infor- 
mation for a paper document. The electronic document 
information determined for a paper document may in- 
clude information identifying an electronic document 
corresponding to the paper document. The electronic 
document information may also include information 
identifying a location where the electronic document is 
stored or a pointer or reference to the electronic docu- 
ment. According to an embodiment of the present inven- 
tion, the electronic document information determined for 
a paper document is stored along with identification 
code information read from an identification tag that is 
physically associated with the paper document. Accord- 
ing to another embodiment of the present invention, the 
electronic document information for a paper document 
is stored in an identification tag that is physically asso- 
ciated with the paper document or physically associated 
with another paper document generated based upon the 
paper document. 

[001 2] For purposes of this invention, the term "paper 
document" is intended to refer to any document com- 
prising one or more pages of any tangible medium on 
which information can be printed, written, drawn, im- 
printed, embossed, etc. For example, a paper document 
may referto a document printed by a printer, a document 
output by a copier or scanner, a document output by a 
facsimile machine, a photograph, a book, a single page 
of paper, a document comprising a plurality of papers, 
and the like. 



[0013] For purposes of this invention, the term "elec- 
tronic document" is intended to refer to information 
stored in an electronic or digital form. Examples of elec- 
tronic documents include files, documents created us- 

5 ing application programs such as word processors, 
email programs, image editors, etc., scanned images of 
paper documents, documents stored in various formats 
such as PDF, Postscript, etc., and other types of docu- 
ments stored in electronic form. 

io [0014] Fig. 1 A is a simplified block diagram of a sys- 
tem 1 00 that may incorporate an embodiment of the 
present invention. As depicted in Fig. 1A, system 100 
comprises a paper document processing device 102 
coupled to a data processing system 1 04 via communi- 

'5 cation links 1 06. Data processing system 1 04 is coupled 
to a paper document tracking database 1 08 and to one 
or more document information databases 110. It should 
be understood that the configuration depicted in Fig. 1A 
is merely illustrative of an embodiment incorporating the 

20 present invention and does not limit the scope of the in- 
vention as recited in the claims. One of ordinary skill in 
the art would recognize other variations, modifications, 
and alternatives. 

[0015] Communication links 106 depicted in Fig. 1A 

25 may be of various types including hardwire links, optical 
links, satellite or other wireless communication links, 
wave propagation links, or any other mechanisms for 
communication of information. Various communication 
protocols may be used to facilitate communication of in- 

30 formation via the communication links. These commu- 
nication protocols may include TCP/IP, HTTP protocols, 
extensible markup language (XML), wireless applica- 
tion protocol (WAP), protocols underdevelopment by in- 
dustry standard organizations, vendor-specific proto- 

35 cols, customized protocols, and others. 

[0016] Communication links 1 06 may traverse one or 
more communication networks. These communication 
networks may include a LAN, a wide area network 
(WAN), a metropolitan area network (MAN), a wireless 

40 network, an Intranet, the Internet, a private network, a 
public network, a switched network, an optical network, 
or any other suitable communication network. 
[001 7] Paper document processing device 1 02 is con- 
figured to physically associate identification tags with 

45 paper documents. Paper document processing device 
102, in cooperation with data processing system 104, 
also facilitates identification of electronic document in- 
formation for a paper document. The electronic docu- 
ment information may include information identifying an 

so electronic document corresponding to the paper docu- 
ment and information identifying a location where the 
electronic document is stored or from where the elec- 
tronic document can be accessed or a pointer or refer- 
ence to the electronic document. In the embodiment de- 

55 picted in Fig. 1A, paper document processing device 
1 02 provides information to data processing system 1 04 
that is used by data processing system 1 04 to determine 
electronic document information for a paper document 
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processed by paper document processing device 102. 
Paper document processing device 102 receives the 
electronic document information from data processing 
system 104 and may store the electronic document in- 
formation or a portion thereof on the identification tag 
that is physically associated with the paper document. 
Paper document processing device 1 02 may be embod- 
ied as a printer, scanner, copier, facsimile machine, 
camera that are capable of printing photos, and the like. 
[0018] According to an embodiment of the present in- 
vention, paper document processing device 1 02 is con- 
figured to receive a paper document 1 1 2 and physically 
associate an identification tag with document 11 2 to pro- 
duce document 1 1 2'. For purposes of clarity, paper doc- 
ument 1 1 2 will be referred to as the "original paper doc- 
ument". Accordingly, in this embodiment, paper docu- 
ment 112' represents the original paper document with 
an identification tag physically associated with it. Origi- 
nal paper document 1 1 2 may be a single-page or a mul- 
ti-page paper document. 

[0019] In other embodiments, paper document 
processing device 102 may be configured to generate 
a new paper document 114 based upon original paper 
document 112 and physically associate an identification 
tag with new document 114. For example, if paper doc- 
ument processing device 102 is configured to function 
as a copier, paper document processing device 1 02 may 
generate a photocopy 1 1 4 for paper document 1 1 2 and 
physically associate an identification tag with photocopy 
114. 

[0020] In yet other embodiments, paper document 
processing device 102 may be configured to receive 
electronic information 116 and print the electronic infor- 
mation on a paper medium to generate paper document 
118. In this embodiment, paper document processing 
device 1 02 is configured to physically associate an iden- 
tification tag with paper document 1 1 8. For example, pa- 
per document processing device 1 02 may be configured 
to function as a printer or a facsimile machine. In a print- 
er mode, paper document processing device 102 re- 
ceives electronic information 1 1 6 and generates a print- 
out 118 based upon the information. In a facsimile ma- 
chine mode, paper document processing device 102 re- 
ceives facsimile information 1 1 6 and generates a fac- 
simile paper document 118. An identification tag is at- 
tached to the printout or facsimile paper document. 
[0021] As shown in Fig. 1 A, paper document process- 
ing device 1 02 includes a processor 1 20 that communi- 
cates with a number of peripheral subsystems via a bus 
subsystem 122. These peripheral subsystems may in- 
clude a storage subsystem 124, a user interface sub- 
system 1 26, a network interface subsystem 1 28, a scan- 
ner subsystem 130, a print subsystem 132, and a tag 
subsystem 134. 

[0022] Bus subsystem 1 22 provides a mechanism for 
letting the various components and subsystems of pa- 
per document processing device 1 02 communicate with 
each other as intended. Although bus subsystem 122 is 



shown schematically as a single bus, alternative em- 
bodiments of the bus subsystem may utilize multiple 
buses. 

[0023] User interface subsystem 1 26 allows user in- 

5 teraction with paper document processing device 102. 
A user may be a human user, a device, a process, an- 
other computer, an application program, and the like. 
User interface subsystem 1 26 may be implemented us- 
ing a touch pad, a touch screen, mechanical buttons, 

10 and the like. For example, when paper document 
processing device 102 is configured to function as a 
copier, interface 126 allows the user to select options 
for generating a photocopy 114 from original document 
112. For example, interface 126 may allow the user to 

15 select the paper size to be used for generating the pho- 
tocopy, the copy darkness, the copy contrast, number 
of copies to be produced, magnification/reduction ratio, 
and other like options known to those skilled in the art. 
Interface 126 may also provide a "Copy/Start" key that 

20 allows the user to initiate the copying function . Upon se- 
lection of the "Start/Copy" key, paper document 
processing device 102 prints a copy 1 1 4 (or photocopy) 
of original document 112. 

[0024] According to an embodiment of the present in- 

25 vention, interface 1 1 6 allows a user to select if an iden- 
tification tag is to be physically associated with docu- 
ments 112', 114, or 118 generated by paper document 
processing device 102. For example, before photocop- 
ying an original document 112, interface 126 may 

30 prompt the user to select whether an identification tag 
is to be physically associated with the photocopy. An 
identification tag is then physically associated with the 
photocopy paper document only if the user indicates 
that an identification tag is to be physically associated 

35 with the photocopy. 

[0025] In an alternative embodiment of the present in- 
vention, an identification tag may be physically associ- 
ated with documents 112', 114, or 118 without prompting 
the user or without receiving the user's permission. For 

40 example, when functioning in a copier mode, activation 
of the "Copy/Start" key by the user is sufficient to initiate 
the photocopying function, cause paper document 
processing device 1 02 to physically associate an iden- 
tification tag to the generated photocopy paper docu- 

45 ment, perform operations to determine electronic docu- 
ment information for the paper document, and store the 
electronic document information or a portion thereof on 
the identification tag that is physically associated with 
the generated paper document. 

so [0026] According to an embodiment of the present in- 
vention, selection of the an operation key such as 
"Copy/Start" key represents a single user command that 
is interpreted by paper document processing device 1 02 
to request both performance of the operation (e.g., cop- 

55 ying), physically associating an identification tag with 
the generated photocopy, performing operations to de- 
termine electronic document information for the paper 
document, and storing the electronic document informa- 
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tion or a portion thereof on the identification tag that is 
physically associated with the generated paper docu- 
ment. In this embodiment, the user may not be aware 
that an identification tag has been applied to the paper 
document (112', 114 or 118) produced by paper docu- 
ment processing device 102 and that electronic docu- 
ment information corresponding to the paper document 
has been stored on the identification tag. 
[0027] User interface subsystem 1 1 6 also serves as 
an interface for outputting information to the user. For 
example, information related to the status of paper doc- 
ument processing device 1 02 or information related to 
operations performed by paper document processing 
device 1 02 might be output to the user via interface 1 26. 
[0028] Network interface 1 28 provides an interface to 
other computer systems, networks, and devices. As de- 
picted in Fig. 1A, network interface subsystem 128 en- 
ables paper document processing device 102 to com- 
municate and exchange information with data process- 
ing system 1 04. Embodiments of network interface sub- 
system 128 include an Ethernet card, a modem (tele- 
phone, satellite, cable, ISDN, etc.), (asynchronous) dig- 
ital subscriber line (DSL) units, and the like. Network in- 
terface system 128 facilitates transfer of information to 
and from paper document processing device 102 in an 
appropriate format. 

[0029] Processor 1 20 performs or controls the overall 
functioning of paper document processing device 102. 
For example, processor 120 may execute a program 
that controls the functions of the various subsystems of 
paper document processing device 102. These func- 
tions may include physically associating identification 
tags with documents 112', 114, or 118, facilitating iden- 
tification of electronic document information for paper 
documents, and storing the electronic document infor- 
mation or a portion thereof or other information on the 
identification tags that are physically associated with pa- 
per documents 112', 114, or 118. 
[0030] Storage subsystem 124 stores the basic pro- 
gramming and data constructs that provide the function- 
ality of paper document processing device 1 02. For ex- 
ample, according to an embodiment of the present in- 
vention, software modules and control programs imple- 
menting the functionality of paper document processing 
device 1 02 may be stored in storage subsystem 1 24. 
Processor 120 may execute these software modules. 
Storage subsystem 124 may also provide a repository 
or database for storing information according to the 
teachings of the present invention. 
[0031] Scanner subsystem 130 includes a scanner 
136 and an image-processing unit 138. Scanner 136 is 
configured to capture an electronic representation (e.g., 
a scanned image) of original paper document 112. Ac- 
cording to an embodiment of the present invention, the 
scanned image is transmitted to image processing unit 
138 that operates to remove distortion inherent in the 
scanning process. The output of image-processing unit 
1 38 is a scanned image of original paper document 1 1 2 



with the distortion removed. The scanned image may 
then be forwarded to print subsystem 1 32 for further 
processing via video bus 140. The scanned image may 
be used to generate document 114. 

5 [0032] According to an embodiment of the present in- 
vention, an image data tap 142 provides processor 120 
with access to scanned image data transferred over vid- 
eo bus 140. In alternative embodiments of the present 
invention, the image scanned by scanner 136 may be 

10 provided to processor 1 20 by scanner subsystem 1 30. 
[0033] According to an embodiment of the present in- 
vention, the electronic representation of original paper 
document 112 (e.g., the scanned image of original doc- 
ument 1 1 2) captured by paper document processing de- 

15 vice 102 is communicated to data processing system 
104 for further processing. Data processing system 104 
uses the electronic representation of original paper doc- 
ument 112 received from paper document processing 
device 102 to determine electronic document informa- 

20 tion for the paper document. The electronic document 
information or a portion thereof is then communicated 
to paper document processing device 1 02 from data 
processing system 104. 

[0034] Print subsystem 132 is used in embodiments 
25 where paper document processing device 102 is con- 
figured to generate a new paper document, for example, 
documents 114 or 118. For example, print subsystem 
132 is used in embodiments where paper document 
processing device 1 02 is configured to operate as a cop- 
30 ier, printer, facsimile machine, etc. Print subsystem 132 
operates in accordance with well-known laser xerogra- 
phy principles. In a copier mode, print subsystem 1 32 is 
configured to receive scanned information from scanner 
subsystem 130 and produce a paper document photo- 
ns copy 1 14 of original document 112. In a printer or fac- 
simile machine mode, print subsystem 1 32 is configured 
to generate a paper document printout or facsimile doc- 
ument based on electronic information 1 1 6 or based up- 
on scanned information from scanner subsystem 130. 
40 Print subsystem 132 operates in synchrony with tag 
subsystem 1 34 to ensure that an identification tag is ap- 
propriately physically associated with documents 112', 
114, or 118. 

[0035] Tag subsystem 134 is configured to physically 
45 associate identification tags with documents 112', 114, 
or 1 1 8 generated by paper document processing device 
102. Various devices known to those skilled in the art 
may be used as identification tags. According to an em- 
bodiment of the present invention, each identification 
so tag device includes an on-device memory that is capa- 
ble of storing information. 

[0036] According to an embodiment of the present in- 
vention, each identification tag stores information that 
can be used to uniquely identify the paper document 
55 with which the identification tag is physically associated. 
For example, an identification tag may store an identifi- 
cation code that may be used to uniquely identify a pa- 
per document with which the identification tag is physi- 
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cally associated. It should be understood that other 
types of information that can be used to identify docu- 
ments might also be stored in an identification tag in al- 
ternative embodiments of the present invention. 
[0037] Depending upon the type ot identification tag 
used, the identification code information may be stored 
in the memory of the identification tag when the tag is 
manufactured, when the tag is activated, or at some lat- 
er time using software or hardware techniques. For ex- 
ample, paper document processing device 102 may be 
configured to write a unique identification code in the 
memory of an identification tag that is selected to be 
physically associated with a paper document generated 
by paper document processing device 102. 
[0038] According to an embodiment of the present in- 
vention, radio frequency identification devices (RFIDs) 
are used as identification tags. For example, the Mu- 
chip manufactured by Hitachi, Ltd. may be used as an 
identification tag. The Mu-chip is a radio recognition IC 
chip that measures 0.4-mm square and is 60 microns 
thick in its present embodiment. Due to its reduced size, 
the Mu-chip can be embedded in paper. The Mu-chip 
comprises an on-chip memory (presently a 128-bit 
ROM) that can be programmed to store information (e. 
g., an identification code, electronic document informa- 
tion). 

[0039] According to an embodiment of the present in- 
vention, tag subsystem 134 is configured to read infor- 
mation (e.g., identification code) from an identification 
tag that is physically associated with a paper document. 
In order to facilitate reading of the identification code 
from the identification tag, one or more tag readers 144 
may be incorporated into tag subsystem 134. If an iden- 
tification tag is physically associated with original docu- 
ment 112, the tag readers of tag subsystem 134 may 
read the information stored from the identification tag 
physically associated with the original document. The 
tag readers are also configured to read information from 
identification tags that are either physically associated 
with or selected to be physically associated with docu- 
ments 112*, 1 14, or 1 1 8. The identification code informa- 
tion read by paper document processing device 102 
from an identification tag is forwarded to data process- 
ing system 104 for further processing. 
[0040] The type of tag reader used and the technique 
used by the tag reader for reading the information stored 
on an identification tag depends on the type of identifi- 
cation tags. According to an embodiment of the present 
invention, a tag reader may read information from an 
identification tag by scanning the identification tag. In 
other embodiments, a tag reader may read the informa- 
tion by receiving the information from the identification 
tag. A tag reader may receive the information from an 
identification tag in response to an external stimulus (e. 
g., electromagnetic energy) or signal provided to the 
identification tag. The external stimulus or signal may 
be provided by the tag reader or by some other device 
or application. If the identification tag is self-powered, 



the identification tag may be configured to communicate 
the information stored by the identification tag to the tag 
reader. A tag reader may read the information stored by 
a tag reader in various forms including electromagnetic 
5 waves, electronic signals, acoustic signals, etc. 

[0041] For example, in its present embodiment, infor- 
mation stored by a Hitachi Mu-chip can be read by ex- 
posing the Mu-chip to 2.45 GHz band microwaves. The 
Mu-chip uses the microwave energy as a power source 
10 and communicates the information stored in the on-chip 
memory in the form of radio waves that can be detected 
and read by a tag reader. In alternative embodiments, 
the Mu-chip may also include an on-chip antenna to in- 
crease the communication distance range. In yet other 
is embodiments, a power source may be provided on the 
Mu-chip and the Mu-chip may be programmed to com- 
municate the information stored by the Mu-chip to a tag 
reader. Accordingly, if a Hitachi Mu-chip is used as an 
identification tag, tag reader 1 44 included in tag subsys- 
tem 134 may be constructed to expose the Mu-chip to 
2.45 GHz band of microwaves and to detect and read 
radio waves emanating from the Mu-chip. Various other 
types of tag readers that are capable of detecting elec- 
tromagnetic, optical, acoustical, or other signals may be 
used in alternative embodiments of the present inven- 
tion using other forms of identification tags. 
[0042] Paper document processing device 102 may 
use various different techniques to physically associate 
an identification tag with a paper document (document 
112', 114, or 118) generated by paper document 
processing device 102. The identification tag may be 
permanently or temporarily attached to a paper docu- 
ment or a page of the paper document. The tag may be 
attached to the paper document using clips, staples, ad- 
hesives, ties, strings, rubber bands, labels with identifi- 
cation tags, and the like. The identification tag may be 
embedded in a page of the paper document. An identi- 
fication tag may also be incorporated into an object such 
as a label, a staple, etc. that is attached to the paper 
document. 

[0043] According to one technique, depending on the 
size of the identification tag, the identification tag is em- 
bedded in a page of the paper document. For example, 
a Mu-chip, due to its reduced size, can be embedded in 
a page of the paper document. A paper page that has 
an identification tag embedded in it may be referred to 
as "tagged paper". Accordingly, in an embodiment using 
Hitachi Mu-chips as identification tags, paper document 
processing device 102 may physically associate an 
identification tag with a paper document by using at least 
one tagged paper to generate the paper document. Ac- 
cording to an embodiment of the present invention, if the 
paper document processing device 102 has access to 
tagged paper, paper document processing device 1 02 
may use one or more of the tagged papers to generate 
the paper document. This technique may be used when 
paper document processing device 102 generates a 
new paper document (either paper document 114 or 
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1 1 8). In this embodiment, tag subsystem 1 34 in conjunc- 
tion with print subsystem 132 ensures that at least one 
tagged page is used to generate paper document 114 
or 118. If the generated new paper document consists 
of multiple pages, then normal paper pages (i.e., paper 
pages without the embedded identification tag) may be 
used for other pages of the document. The selection of 
one or more tagged papers for generating paper docu- 
ments 114 or 118 may be performed by tag subsystem 
124, by print subsystem 122, or by the two in conjunc- 
tion. 

[0044] An identification tag may also be physically as- 
sociated with a paper document by physically applying 
or attaching the identification tag to the paper document. 
If the size of the identification tag permits, the identifica- 
tion tag may be attached to the paper document such 
that it is not directly visible to the unaided observer or 
user of the paper document. For example, the identifi- 
cation tag may be embedded or mounted on a staple, 
and paper document processing device 1 02 may attach 
the staple to the paperdocument. Such a technique may 
be used to generate document 112* by applying an iden- 
tification tag to original document 112. Such a technique 
may also be used to generate documents 114 or 118. 
[0045] According to an embodiment of the present in- 
vention, an identification tag may be attached to a paper 
document by mixing the identification tag in the ink that 
is printed on one or more pages of the paper document. 
Such a technique may be used when paper document 
processing device 1 02 generates a new document (i.e., 
document 114 or 118). An identification tag may also be 
physically associated with a paper document generated 
by paper document processing device 1 02 using an ad- 
hesive, a clip, or other like mechanism. 
[0046] In alternative embodiments, paper document 
processing device 102 may dispense a sticker or label 
comprising an identification tag. The identification tag 
may be physically associated with a paper document 
(either document 112', 1 1 4, or 1 1 8) generated by paper 
document processing device 1 02 by applying the sticker 
or label to the paper document. For example, paper doc- 
ument processing device 102 may dispense a label 
comprising an identification tag and a user may be re- 
quested to apply the dispensed label to a paper docu- 
ment. For example, the label may be applied to original 
document 1 1 2 to generate document 1 1 2'. Various other 
techniques may also be used to physically associate an 
identification tag with a paper document. 
[0047] As indicated above, according to an embodi- 
ment of the present invention, each identification tag 
stores identification code information that is used to 
uniquely identify the paper document with which the 
identification tag is physically associated. According to 
an embodiment of the present invention, paper docu- 
ment processing device 1 02 may be configured to store 
the identification code information in an identification 
tag. Paper document processing device 102 may also 
be configured to store electronic document information 



or a portion thereof in an identification tag that is phys- 
ically associated with a paper document. In the embod- 
iment depicted in Fig. 1 A, paper document processing 
device 102 receives the electronic document informa- 

5 tion from data processing system 1 04. The electronic 
document information may include information identify- 
ing an electronic document for the paper document, in- 
formation identifying a storage location of the electronic 
document or a pointer or reference to the electronic doc- 

10 ument. 

[0048] In addition to identification code information 
and electronic document information, other type of in- 
formation related to a paper document may also be 
stored in an identification tag that is physically associ- 

15 ated with the paper document. For example, the infor- 
mation stored in an identification tag may include an 
electronic representation ofthe contents of the paper 
document (e.g., a scanned image of the paper docu- 
ment, a Postscript format file or Portable Document For- 

20 mat (PDF) or image format or other file format, etc.), in- 
formation identifying the author of the paper document, 
the date when the paper document was created, the 
number of pages in the paper document, and other like 
information. 

25 [0049] The information stored in an identification tag 
may be used to track the physical location of the paper 
document with which the identification tag is physically 
associated. 

[0050] The various subsystems of paper document 
30 processing device 1 02 function in synchrony to ensure 
proper functioning of paper document processing de- 
vice 102. Various synchronization signals known to 
those skilled in the art may be used to achieve this syn- 
chronization. It should be understood that various other 
35 configurations of paper document processing device 
1 02 are possible having more orfewer components than 
the system depicted in Fig. 1A. 

[0051] Data processing system 104 depicted in Fig. 
1A is configured to determine electronic document in- 

40 formation for a paper document based upon information 
related to the paper document received from paper doc- 
ument processing device 1 02 and based upon informa- 
tion stored in one or more document information data- 
bases 110.. Data processing system 104 may be any 

45 computer system known to those skilled in the art. 
[0052] As indicated above, for a paper document 
processed by paper document processing device 102, 
data processing system 104 receives, from paper doc- 
ument processing device 102, identification code infor- 

50 mation used to identify the paper document and an elec- 
tronic representation (e.g., a scanned image) of the pa- 
per document captured by paper document processing 
device 102. According to an embodiment ofthe present 
invention, data processing system 104 extracts a fea- 

55 ture descriptor from the scanned image received from 
paper document processing device 102. Based upon 
the feature descriptor determined from the electronic 
representation of the paper document and based upon 
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information stored in document information databases 
110, data processing system 1 04 determines electronic 
document information for the paper document. Further 
details related to processing performed by data 
processing system 1 04 for determining electronic doc- 
ument information for a paper document arc described 
below. 

[0053] A feature descriptor extracted from an elec- 
tronic representation of a paper document represents 
information extracted from electronic representation (e. 
g., a scanned image). The feature descriptor can be 
used to uniquely identify the document. The extracted 
feature descriptor can be matched to other feature de- 
scriptors if the feature descriptors are consistently ex- 
tracted from a similarly located section (e.g., a small rec- 
tangular patch such as a 1 inch by 1 inch patch) of the 
scanned image. If feature descriptors are extracted from 
consistent portions of the electronic representations (e. 
g., a scanned image) of paper documents, then if two 
feature descriptors match then there is a high likelihood 
that the scanned images from which the feature descrip- 
tors were extracted also match. Alternatively, if two fea- 
ture descriptors do not match, then there is a high like- 
lihood that the scanned images used for extracting the 
feature descriptors do not match, which in turn implies 
that the documents corresponding to the scanned im- 
ages are different. One technique used by data process- 
ing system 104 for extracting a feature descriptor from 
an electronic representation of a paper document is de- 
scribed below. Various other feature extraction tech- 
niques known to those skilled in the art may also be used 
by data processing system 104. For example, various 
feature extraction techniques described in U.S. Patent 
No. 6,104,834 may also be used by the present inven- 
tion. The entire contents of U.S. Patent No. 6,104,834 
are incorporated herein by reference for all purposes. 
[0054] According to an embodiment of the present in- 
vention, the electronic document information deter- 
mined by data processing system 1 04 is communicated 
to paper document processing device 1 02. Paper doc- 
ument processing device 102 may then store the elec- 
tronic document information or a portion thereof in the 
identification tag that is physically associated with the 
paper document. 

[0055] According to an embodiment of the present in- 
vention, each document information database 110 
stores feature descriptors that have been previously de- 
termined for a plurality of documents. For each feature 
descriptor, electronic document information corre- 
sponding to the feature descriptor is also stored. As de- 
picted in Fig. 1A, the information may be stored in the 
form of a table wherein each row of the table stores in- 
formation related to a particular feature descriptor. As 
shown in Fig. 1 A, each row stores information identifying 
a feature descriptor and electronic document informa- 
tion corresponding to the feature descriptor. 
[0056] The electronic document information may 
comprise information identifying an electronic document 



and a storage location for the electronic document rep- 
resented by the feature descriptor (or a pointer of refer- 
ence to the electronic document). For example, elec- 
tronic document information for a particular feature de- 
5 scriptor may identify a uniform resource identifier or lo- 
cator (URI or URL) pointing to an electronic document 
corresponding to the particular feature descriptor, a file 
corresponding to the particular feature descriptor and a 
directory location where the file is stored, and other like 
information. According to an embodiment of the present 
invention, the information in a document information da- 
tabase 110 is indexed by the feature descriptors infor- 
mation. Accordingly, for a particular feature descriptor 
extracted by data processing system 1 04 for a particular 
paper document processed by paper document 
processing device 102, data processing system 104 
may use the feature descriptor to search document in- 
formation databases 110 to find one or more matching 
feature descriptors. 

[0057] Data processing system 1 04 is also configured 
to store information received from paper document 
processing device 102 and the electronic document in- 
formation determined by data processing system 104. 
According to an embodiment of the present invention, 
for each identification code for a paper document re- 
ceived from paper document processing device 1 02, da- 
ta processing system 1 04 stores the identification code 
information and the corresponding electronic document 
information determined from the electronic representa- 
tion corresponding to the paper document identified by 
the identification code. The electronic document infor- 
mation determined for a document is associated with the 
identification code corresponding to the document and 
stored in a manner such that the electronic document 
information for a particular paper document can be de- 
termined given the identification code for the paper doc- 
ument. 

[0058] Other information received by data processing 
system 1 04 from paper document processing device 
102 for the particular identification code may also be 
stored such that the other information can be retrieved 
given the identification code information. For example, 
information identifying physical locations of paper 
documents may also be stored. Other information relat- 
ed to the paper document that is received by data 
processing system 104 from paper document process- 
ing device 102 such as an electronic representation of 
the paper document may also be stored and associated 
with a particular identification code. 
[0059] According to an embodiment of the present in- 
vention, the identification code information, the electron- 
ic document information, and other information is stored 
in paper document tracking database 1 08. Paper docu- 
ment tracking database 108 may be indexed by the 
identification code information. As depicted in Fig. 1A, 
paper document tracking database 108 may be imple- 
mented as a table wherein each row of the table stores 
information related to a particular identification code. As 
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shown in Fig. 1 A, each row stores information identifying 
an identification code, electronic document information 
determined by data processing system 1 04 for a paper 
document corresponding to the identification code, 
physical location information for the paper document 
identified by the identification code, and other informa- 
tion related to the paper document identified by the iden- 
tification code. 

[0060] Fig. 1 B is a simplified block diagram of another 
system 150 that may incorporate an embodiment of the 
present invention. The embodiment depicted in Fig. 1B 
is similar to the embodiment depicted in Fig. 1 A with da- 
ta processing system 1 04 removed. In the embodiment 
depicted in Fig. 1 B, paper document processing device 
1 02 is configured to perform the functions that were per- 
formed by data processing system 104 depicted in Fig. 
1A. It should be understood that the configuration de- 
picted in Fig. 1 B is merely illustrative of an embodiment 
incorporating the present invention and does not limit 
the scope of the invention as recited in the claims. One 
of ordinary skill in the art would recognize other varia- 
tions, modifications, and alternatives. 
[0061] Fig. 2 is a simplified high-level flowchart 200 
depicting a method performed by paper document 
processing device 1 02 for generating a paper document 
(either document 112', 114, or 118) according to an em- 
bodiment of the present invention. The flowchart depict- 
ed in Fig. 2 assumes that a paper document processing 
device 1 02 is coupled to a data processing system 1 04 
(similar to the embodiment shown in Fig. 1A). However, 
it should be understood that the general processing de- 
picted in Fig. 2 may also be applied to other embodi- 
ments of the present invention. Further, flowchart 200 
depicted in Fig. 2 assumes that a new paper document 
114 is to be generated based upon original paper doc- 
ument 112. It should however be understood that the 
processing depicted in Fig. 2 may also be applied to 
generate documents 112' or 118. Accordingly, flowchart 
200 depicted in Fig. 2 is merely illustrative of an embod- 
iment incorporating the present invention and does not 
limit the scope of the invention as recited in the claims. 
One of ordinary skill in the art would recognize varia- 
tions, modifications, and alternatives. The processing 
depicted in Fig. 2 may be performed by software mod- 
ules executed by a processor of paper document 
processing device 102, by hardware modules of paper 
document processing device 102, or a combination 
thereof. 

[0062] As depicted in Fig. 2, processing is initiated 
when paper document processing device 102 receives 
a signal to generate a new paper document based upon 
an original document 112 provided to paper document 
processing device 102 (step 202). The signal may be 
received from various sources. For example, if paper 
document processing device 102 is configured to func- 
tion as a copier, the signal may be received when a user 
selects a "Copy/Start" key provided by the user interface 
of paper document processing device 102. The signal 



may also be received from various applications and de- 
vices. 

[0063] Paper document processing device 102 then 
obtains an electronic representation of the original pa- 
5 per document (step 204). According to an embodiment 
of the present invention, paper document processing 
device 1 02 captures a scanned image of the original pa- 
per document in step 204. As described above, scanner 
subsystem 130 of paper document processing device 
102 may be configured to capture and generate a 
scanned image of the original paper document. 
[0064] Paper document processing device 102 then 
generates a new paper document based upon the orig- 
inal paper document and an identification tag is physi- 
cally associated with the generated new paper docu- 
ment (step 206). As described above, various tech- 
niques may be used to physically associate an identifi- 
cation tag with the new generated paper document. Ac- 
cording to an embodiment of the present invention, the 
identification tag may be automatically physically asso- 
ciated with the paper document, for example, by using 
tagged paper to generate the new paper document, ap- 
plying an object such as a staple with the identification 
tag embedded in it to the new generated paper docu- 
ment, using ink containing the identification tag to print 
the paper document, etc. In alternative embodiments, 
paper document processing device 102 may dispense 
an identification tag (e.g., a label comprising an identi- 
fication tag) and request a user to physically associate 
the dispensed identification tag with the new paper doc- 
ument generated by paper document processing device 
102. 

[0065] Paper document processing device 102 then 
reads the identification code stored by the identification 
tag that is physically associated with the new paper doc- 
ument generated in step 206 (step 208). Tag readers 
1 44 of tag subsystem 1 34 may be used to read the iden- 
tification code information from the identification tag. As 
previously described, various different techniques may 
be used to read identification code information stored 
by the identification tag. The type of technique used for 
reading information stored by the identification tag gen- 
erally depends on the type of identification tag used by 
the paper document output device. The identification 
code read in step 208 is used to uniquely identify the 
paper document generated in step 206. 
[0066] The identification code read in step 208 and 
the electronic representation of the original paper doc- 
ument obtained in step 204 is then communicated to da- 
ta processing system 104 for further processing (step 
21 0). Other information related to the new paper docu- 
ment may also be communicated to data processing 
system 104. For example, information identifying the 
physical location of the new paper document may be 
communicated to data processing system 104. 
[0067] Paper document processing device 102 then 
receives electronic document information (or a portion 
thereof) from data processing system 104 (step 212). 
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According to an embodiment of the present invention, 
the electronic document information is determined by 
data processing system 104 based upon the electronic 
representation communicated to data processing sys- 
tem 104 in step 210. 

[0068] Paper document processing device 1 02 then 
stores the electronic document information (or a portion 
thereof) received in step 212 on the identification tag 
that is physically associated with the new paper docu- 
ment generated in step 206 (step 214). Various tech- 
niques known to those skilled in the art may be used for 
storing the information in the memory of the identifica- 
tion tag. Steps 212 and 214 may not be performed in 
certain embodiments of the present invention. 
[0069] Fig. 3 is a simplified high-level flowchart 300 
depicting a method performed by paper document 
processing device 1 02 for generating a paper document 
(either document 1 1 2', 1 1 4, or 1 1 8) according to another 
embodiment of the present invention. The flowchart de- 
picted in Fig. 3 assumes that a paper document 
processing device 102 is coupled to a data processing 
system 104 (similar to the embodiment shown in Fig. 
1 A). However, it should be understood that the general 
processing depicted in Fig. 3 may also be applied to oth- 
er embodiments of the present invention. Further, flow- 
chart 300 depicted in Fig. 3 assumes that a new paper 
document 114 is to be generated based upon original 
paper document 112. It should however be understood 
that the processing depicted in Fig. 3 may also be ap- 
plied to generate documents 112' or 118. Accordingly, 
flowchart 300 depicted in Fig. 3 is merely illustrative of 
an embodiment incorporating the present invention and 
does not limit the scope of the invention as recited in the 
claims. One of ordinary skill in the art would recognize 
variations, modifications, and alternatives. The process- 
ing depicted in Fig. 3 may be performed by software 
modules executed by a processor of paper document 
processing device 102, by hardware modules of paper 
document processing device 102, or a combination 
thereof. 

[0070] As depicted in Fig. 3, processing is initiated 
when paper document processing device 102 receives 
a signal to generate a new paper document based upon 
an original document 112 provided to paper document 
processing device 102 (step 302). The signal may be 
received from various sources. For example, if paper 
document processing device 102 is configured to func- 
tion as a copier, the signal may be received when a user 
selects a "Copy/Start" key provided by the user interface 
of paper document processing device 102. The signal 
may also be received from various applications and de- 
vices. 

[0071] Paper document processing device 102 then 
obtains an electronic representation of the original pa- 
per document (step 304). According to an embodiment 
of the present invention, paper document processing 
device 1 02 captures a scanned image of the original pa- 
per document. As described above, scanner subsystem 



130 of paper document processing device 102 may be 
configured to capture and generate a scanned image of 
the original paper document. 

[0072] An identification tag that is to be physically as- 
5 sociated with the new paper document is then selected 
(step 306). The identification tag may be selected from 
a plurality of identification tags available to paper docu- 
ment processing device 102. Alternatively, the selected 
identification tag may represent an identification tag em- 
io bedded in a tagged paper to be used for generating the 
new paper document. The selected identification tag 
may also represent an identification tag that is attached 
to an object (e.g., a staple, a clip) that is to be physically 
associated with the new paper document, 
'5 [0073] Paper document processing device 102 then 
reads the identification code stored by the identification 
tag selected in step 306 (step 308). Tag readers 144 of 
tag subsystem 1 34 may be used to read the identifica- 
tion code information from the selected identification 
tag. As previously described, various different tech- 
niques may be used to read identification code informa- 
tion stored by the identification tag. The type of tech- 
nique used for reading information stored by the select- 
ed identification tag generally depends on the type of 
identification tag used by the paper document output de- 
vice. The identification code read in step 308 is used to 
uniquely identify the new paper document to be gener- 
ated. 

[0074] The identification code read in step 308 and 
the electronic representation of the original paper doc- 
ument obtained in step 304 is then communicated to da- 
ta processing system 1 04 for further processing (step 
310). Other information related to the new paper docu- 
ment may also be communicated to data processing 
system 104. For example, information identifying the 
physical location of the new paper document may be 
communicated to data processing system 1 04. 
[0075] Paper document processing device 102 then 
receives electronic document information (or a portion 
thereof) from data processing system 104 (step 312). 
According to an embodiment of the present invention, 
the electronic document information is determined by 
data processing system 104 based upon the electronic 
representation communicated to data processing sys- 
tem 104 in step 310. 

[0076] Paper document processing device 102 then 
stores the electronic document information (or a portion 
thereof) received in step 312 on the identification tag 
selected in step 306 (step 314). Various techniques 
known to those skilled in the art may be used for storing 
the information in the memory of the identification tag. 
Steps 31 2 and 31 4 may not be performed in certain em- 
bodiments of the present invention. 
[0077] Paper document processing device 102 then 
generates a new paper document based upon the orig- 
inal paper document and the selected identification tag 
is physically associated with the generated new paper 
document (step 31 6). As described above, various tech- 



25 



30 



35 



40 



45 



50 



10 



19 



EP1 398 711 A1 



20 



niques may be used to physically associate an identifi- 
cation tag with the new generated paper document. Ac- 
cording to an embodiment of the present invention, the 
identification tag may be automatically physically asso- 
ciated with the paper document, for example, by using 5 
tagged paper to generate the new paper document, ap- 
plying an object such as a staple with the identification 
tag embedded in it to the new generated paper docu- 
ment, using ink containing the identification tag to print 
the paper document, etc. In alternative embodiments, 10 
paper document processing device 102 may dispense 
an identification tag (e.g., a label comprising an identi- 
fication tag) and request a user to physically associate 
the dispensed identification tag with the new paper doc- 
ument generated by paper document processing device '5 
102. 

[0078] Although flowcharts 200 and 300 depict meth- 
ods for generating a new paper document 1 1 4, the flow- 
charts steps may also be used to physically associate 
an identification tag with original paper document 1 1 2 to 20 
produce document 112'. Fig. 4 is a simplified high-level 
flowchart 400 depicting a method performed by paper 
document processing device 102 for physically associ- 
ating an identification tag with original document 112 to 
produce document 112' according to another embodi- 25 
ment of the present invention. Flowchart 400 depicted 
in Fig. 4 assumes that a paper document processing de- 
vice 1 02 is coupled to a data processing system 1 04 
(similar to the embodiment shown in Fig. 1 A). However, 
it should be understood that the general processing de- so 
picted in Fig. 4 may also be applied to other embodi- 
ments of the present invention. Accordingly, flowchart 
400 depicted in Fig. 4 is merely illustrative of an embod- 
iment incorporating the present invention and does not 
limit the scope of the invention as recited in the claims. 35 
One of ordinary skill in the art would recognize varia- 
tions, modifications, and alternatives. The processing 
depicted in Fig. 4 may be performed by software mod- 
ules executed by a processor of paper document 
processing device 102, by hardware modules of paper 40 
document processing device 102, or a combination 
thereof. 

[0079] As depicted in Fig. 4, processing is initiated 
when paper document processing device 1 02 receives 
a signal to physically associate an identification tag with 45 
an original document 112 (step 402). The signal may be 
received from various sources. For example, the signal 
may be received via a user interface of paper document 
processing device 1 02. The signal may also be received 
from various applications and devices. so 
[0080] Paper document processing device 1 02 then 
obtains an electronic representation of the original pa- 
per document (step 404). According to an embodiment 
of the present invention, paper document processing 
device 1 02 captures a scanned image of the original pa- 55 
per document in step 404. As described above, scanner 
subsystem 130 of paper document processing device 
102 may be configured to capture and generate a 



scanned image of the original paper document. 
[0081] An identification tag that is to be physically as- 
sociated with the original paper document is then select- 
ed (step 406). The identification tag may be selected 
from a plurality of identification tags available to paper 
document processing device 102. Alternatively, the se- 
lected identification tag may represent an identification 
tag attached to an object (e.g., a staple, a clip) that is to 
be physically associated with the original paper docu- 
ment, etc. 

[0082] Paper document processing device 1 02 then 
reads the identification code stored by the identification 
tag selected in step 406 (step 408). Tag readers 144 of 
tag subsystem 1 34 may be used to read the identifica- 
tion code information from the identification tag. As pre- 
viously described, various different techniques may be 
used to read identification code information stored by 
the identification tag. The type of technique used for 
reading information stored by the identification tag gen- 
erally depends on the type of identification tag used by 
the paper document output device. The identification 
code read in step 408 is used to identify the new paper 
document to be generated. 

[0083] The identification code read in step 408 and 
the electronic representation of the original paper doc- 
ument obtained in step 404 is then communicated to da- 
ta processing system 104 for further processing (step 
41 0). Other information related to the original paper doc- 
ument may also be communicated to data processing 
system 104. For example, information identifying the 
physical location of the original paper document may be 
communicated to data processing system 104. 
[0084] Paper document processing device 102 then 
receives electronic document information (or a portion 
thereof) from data processing system 104 (step 412). 
The electronic document information is determined by 
data processing system 104 based upon the electronic 
representation communicated to data processing sys- 
tem 104 in step 410. 

[0085] Paper document processing device 102 then 
stores the electronic document information (or a portion 
thereof) received in step 412 on the identification tag 
selected in step 406 (step 414). Various techniques 
known to those skilled in the art may be used for storing 
the information in the memory of the identification tag. 
Steps 41 2 and 41 4 may not be performed in certain em- 
bodiments of the present invention. 
[0086] Paper document processing device 1 02 then 
physically associates the identification tag selected in 
step 406 with the original paper document (step 416). 
As described above, various techniques may be used 
to physically associate an identification tag with the orig- 
inal paper document. According to an embodiment of 
the present invention, the identification tag may be au- 
tomatically physically associated with the paper docu- 
ment, for example, by applying an object such as a sta- 
ple with the identification tag embedded in it to the orig- 
inal paper document. In alternative embodiments, paper 



11 



21 



EP1 398 711 A1 



22 



document processing device 102 may dispense an 
identification tag (e.g., a label comprising an identifica- 
tion tag) and request a user to physically associate the 
dispensed identification tag with the original paper doc- 
ument. 5 
[0087] As described above, according to an embodi- 
ment of the present invention, data processing system 
104 determines electronic document information for a 
paper document based upon information received from 
paper document processing device 1 02. Fig. 5 is a sim- io 
plified high-level flowchart 500 depicting a method per- 
formed by data processing system 104 for determining 
electronic document information according to an em- 
bodiment of the present invention. The flowchart depict- 
ed in Fig. 5 assumes that a paper document processing 1$ 
device 1 02 is coupled to a data processing system 1 04 
(similar to the embodiment shown in Fig. 1 A). However, 
it should be understood that the general processing de- 
picted in Fig. 5 may also be applied to other embodi- 
ments of the present invention. For example, for the em- 20 
bodiment of the present invention depicted in Fig. 1 B, 
the processing depicted in Fig. 5 is performed by paper 
document processing device 102. Accordingly, flow- 
chart 500 depicted in Fig. 5 is merely illustrative of an 
embodiment incorporating the present invention and 25 
does not limit the scope of the invention as recited in the 
claims. One of ordinary skill in the art would recognize 
variations, modifications, and alternatives. The process- 
ing depicted in Fig. 5 may be performed by software 
modules executed by a processor of data processing 30 
system 104, by hardware modules of data processing 
system 104, or a combination thereof. 
[0088] As depicted in Fig. 5, processing is initiated 
when data processing system 104 receives a signal 
from a paper document processing device 102 (step 35 
502). The signal may comprise information identifying 
an identification code corresponding to an identification 
tag and electronic representation of a paper document. 
Other types of information such as the physical location 
of a paper document may also be included in alternative *o 
embodiments of the present invention. 
[0089] Data processing system 104 then extracts a 
feature descriptor from the electronic representation re- 
ceived in step 502 (step 504). A feature descriptor ex- 
tracted from an electronic representation of a paper doc- 
ument represents information extracted from the elec- 
tronic representation (e.g., a scanned image). The fea- 
ture descriptor can be used to uniquely identify the doc- 
ument. The extracted feature descriptor can be matched 
to other feature descriptors if the feature descriptors are so 
consistently extracted from a similarly located section of 
the electronic representation. For example, a small rec- 
tangular patch from the upper left hand comer of the first 
page of a document may be selected for extracting a 
feature descriptor. One technique used by data process- 55 
ing system 104 for extracting a feature descriptor from 
an electronic representation is described below. Various 
other feature extraction techniques known to those 



skilled in the art may also be used by data processing 
system 1 04. For example, the various feature extraction 
techniques described in U.S. Patent No. 6,104,834 may 
also be used by the present invention. The entire con- 
tents of U.S. Patent No. 6,104,834 are incorporated 
herein by reference for all purposes. 
[0090] Data processing system 104 then searches 
the one or more document information databases 110 
to identify one or more feature descriptors that match 
the feature descriptor extracted in step 504 (step 506). 
If the feature descriptor extracted in step 504 and the 
feature descriptors stored in document information da- 
tabases 1 1 0 are extracted from a consistent location of 
an electronic representation (e.g., a scanned image) of 
a paper document, then if two feature descriptors match, 
it implies that there is a high likelihood that the scanned 
images from which the feature descriptors were extract- 
ed also match. Alternatively, if two feature descriptors 
do not match, then there is a high likelihood that the 
scanned images used for extracting the feature descrip- 
tors are different, and as a result the documents are dif- 
ferent. Accordingly, in step 506 data processing system 
104 identifies one or more feature descriptors in docu- 
ment information databases 110 that match the feature 
descriptor extracted in step 504. 
[0091] As described above, according to an embodi- 
ment of the present invention, each document informa- 
tion database 110 stores feature descriptors that have 
been previously determined for a plurality of documents. 
For each feature descriptor, electronic document infor- 
mation corresponding to the feature descriptor is also 
stored. The electronic document information may com- 
prise information identifying an electronic document, in- 
formation specifying a storage location for the electronic 
document represented by the feature descriptor or a 
pointer or reference to the electronic document. For ex- 
ample, electronic document information for a particular 
feature descriptor may identify a file corresponding to 
the particular feature descriptor and a directory location 
where the file is stored, a uniform resource identifier or 
locator (URI or URL) pointing to an electronic document 
represented by the particular feature descriptor, and 
other like information. The information in a document in- 
formation database 110 may be indexed by the feature 
descriptors information. Accordingly, for the feature de- 
scriptor extracted in step 504, data processing system 
1 04 may use the feature descriptor to search document 
information databases 1 1 0 to find one or more matching 
feature descriptors. Since a set of matching feature de- 
scriptors implies that there is a high likelihood that the 
scanned images corresponding to the feature descrip- 
tors match, it further implies that the electronic docu- 
ment from which a matching feature descriptor was ex- 
tracted corresponds to the paper document from whose 
electronic representation the other feature descriptor 
was extracted. 

[0092] Data processing system 1 04 then determines, 
from document information databases 110, electronic 
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document information corresponding to each matching 
feature descriptor determined in step 506 (step 508). 
Data processing system 104 may then store the elec- 
tronic document information determined in step 508 
along with the identification code received in step 502 
in paper document tracking database 108 (step 510). 
The electronic document information is associated with 
the identification code and stored in a manner such that 
the electronic document information for a particular 
identification code can be determined given the identi- 
fication code. 

[0093] Other information received by data processing 
system 104 from paper document processing device 
1 02 in step 502 for the particular identification code may 
also be stored such that the other information can be 
retrieved given the identification code information. For 
example, information identifying physical location of a 
paper document corresponding to the identification 
code may also be stored in paper document tracking da- 
tabase 108. -The information stored in paper document 
tracking database 108 may be indexed by the identifi- 
cation code information. As depicted in Fig. 1A, paper 
document tracking database 108 may be implemented 
as a table wherein each row of the table stores informa- 
tion related to a particular identification code. As shown 
in Fig. 1A, each row stores information identifying an 
identification code, electronic document information de- 
termined by data processing system 104 for a paper 
document corresponding to the identification code, 
physical location information for the paper document 
identified by the identification code, and other informa- 
tion related to the paper document corresponding to the 
identification code. 

[0094] The electronic document information deter- 
mined in step 508, or a portion thereof, may then be 
communicated to the paper document processing de- 
vice 102 that was the source of the signal received in 
step 502 (step 512). As described above, paper docu- 
ment processing device 102 may then store the infor- 
mation in an identification tag. 

[0095] Fig. 6 is a simplified high-level flowchart 600 
depicting a method of extracting a feature descriptor 
from an electronic representation of a paper document 
according to an embodiment of the present invention. 
The processing depicted in Fig. 6 may be performed by 
data processing system 104 (for embodiment depicted 
in Fig. 1 A) or by paper document processing device 1 02 
(forthe embodiment depicted in Fig. 1B). Flowchart 600 
depicted in Fig. 6 is merely illustrative of an embodiment 
incorporating the present invention and does not limit 
the scope of the invention as recited in the claims. One 
of ordinary skill in the art would recognize variations, 
modifications, and alternatives. The processing depict- 
ed in Fig. 6 may be performed by software modules ex- 
ecuted by a processor, by hardware modules, or a com- 
bination thereof. According to an embodiment of the 
present invention, the processing depicted in Fig. 6 is 
performed in step 504 of flowchart 500 depicted in Fig. 5. 



[0096] As depicted in Fig. 6, a portion of the electronic 
representation is selected (step 602). The selected por- 
tion may represent, for example, a section of the elec- 
tronic representation corresponding to a particular page 

5 of a document. 

[0097] Zones of text, pictures, graphics, photographs, 
etc. are then located on the portion selected in step 602 
(step 604). The coordinates (e.g., X-coordi nates and Y- 
coordinates) and extent of each zone may be calculated 

10 in step 604. 

[0098] A feature extraction technique is then applied 
to at least one zone (or a subset of a zone) determined 
in step 604 to determine a feature descriptor for the elec- 
tronic representation (step 606). To enable matching of 

15 the feature descriptor extracted for the electronic repre- 
sentation and feature descriptors stored in document in- 
formation databases 1 1 0, the location of the one or more 
zones used for feature extraction are the same as those 
selected for extracting feature descriptors stored in doc- 

20 ument information databases 110. 

[0099] Several different feature extraction techniques 
known to those skilled in the art may be applied. Accord- 
ing to one technique, optical character recognition 
(OCR) may be applied to one or more zones and the 

25 results of the OCR analysis represent the feature de- 
scriptor for the electronic representation of the paper 
document. Various other techniques known to those 
skilled in the art may be used in alternative embodi- 
ments of the present invention. 

30 [0100] The feature descriptor extracted in step 606 
may then be used to search one or more document in- 
formation databases 110 as indicated in step 506 of 
flowchart 500 in Fig. 5. According to an embodiment of 
the present invention, a search query may be construct- 

35 ed using the extracted feature descriptor. The search 
query may then be used to search the document infor- 
mation databases 110 to locate matching feature de- 
scriptors per step 506 of flowchart 500 depicted in Fig. 5. 
[01 01] Fig. 7 depicts an example of applying the meth- 

40 od depicted in Fig. 6 according to an embodiment of the 
present invention. As shown in Fig. 7, a portion 704 of 
electronic representation (e.g., scanned image) 702 
representing a first page of a document is selected. Var- 
ious zones 706 are then identified on the selected por- 

45 tion. A subset 708 of a text zone is then selected for 
extracting the feature descriptor. OCR feature extraction 
techniques are then applied to subset zone 708 and the 
result of the OCR analysis represents the feature de- 
scriptor 71 0 for electronic representation 702. A search 

so query 71 2 is then configured based upon the feature de- 
scriptor. Search query 712 may then be used to search 
document information databases 110. 
[0102] As described above, according to an embodi- 
ment of the present invention, electronic document in- 

55 formation for a paper document is located and stored 
along with the identification code read from an identifi- 
cation tag that is physically associated with the paper 
document. For example, the information may be stored 
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in a paper document tracking database 108. According 
to an embodiment of the present invention, the electron- 
ic document information may also be stored in the iden- 
tification tag that is physically associated with the paper 
document. The information stored in paper document 
tracking database 108 and/or the information stored in 
the identification tag may be used in various applications 
for various purposes. 

[0103] According to one application, the stored infor- 
mation for a paper document may be used to generate 
a "perfect" copy of the paper document. A well known 
inherent problem with photocopying a document is that 
the quality of the photocopy degrades as successive 
copies are made. For example, if the original paper doc- 
ument is referred to as the "Oth generation," all copies 
are first generation, at best. Degradation of copy quality 
is inevitable as copies are made from successive (i.e., 
first, second, third, etc.) generation copies wherein the 
cumulative degradation can ultimately result in an un- 
recognizable document. The present invention resolves 
this problem and can be used to generate photocopies 
that are as good as the original paper document, i.e. a 
"perfect" photocopy. 

[0104] Various techniques may be used to generate 
a "perfect" photocopy of a paper document that has an 
identification tag physically associated with it. As de- 
scribed above, according to an embodiment of the 
present invention, the identification tag stores electronic 
document information for the paper document which 
specifies an electronic document corresponding to the 
paper document and the storage location of the elec- 
tronic document. When a photocopy of the paper docu- 
ment is to be generated, the copier may be configured 
to read the electronic document information for the pa- 
per document from the identification tag that is physical- 
ly associated with the paper document, access the elec- 
tronic document corresponding to the paper document, 
and use the electronic document to generate the pho- 
tocopy. In this manner, the copier is able to generate a 
"Oth generation" photocopy on a continuous basis. 
[0105] If the electronic document information is not 
stored in the identification tag that is physically associ- 
ated with the paper document, the copier may be con- 
figured to read the identification code from the identifi- 
cation tag, use the identification code to locate a record 
(e.g., a row in a table) in paper document tracking da- 
tabase 108, find electronic document information for the 
paper document corresponding to the identification 
code from the paper document tracking database, and 
then use the electronic document information to gener- 
ate the photocopy. In this manner, the copier is able to 
generate a "Oth generation" photocopy for the paper 
document. 

[0106] Various other operations, besides photocopy- 
ing, may also be performed on an electronic document 
corresponding to a paper document using the paper 
document. For example, a facsimile machine may be 
configured to read information stored in an identification 



tag that is physically associated with a paper document, 
use the information to locate an electronic document 
corresponding to the paper document, and use the elec- 
tronic document to communicate information via facsim- 
5 ile. A printer may be configured to read information 
stored in an identification tag that is physically associ- 
ated with a paper document, use the information to lo- 
cate an electronic document corresponding to the paper 
document, and generate a printout based upon the elec- 
ta tronic document. Various other operations that can be 
performed on electronic documents are also envisioned 
within the scope of the present invention. 
[0107] According to an embodiment of the present in- 
vention, the physical location of a paper document that 
15 has an identification tag physically associated with it can 
be tracked. 

[0108] Although specific embodiments of the inven- 
tion have been described, various modifications, alter- 
ations, alternative constructions, and equivalents are al- 
so encompassed within the scope of the invention. The 
described invention is not restricted to operation within 
certain specific data processing environments, but is 
free to operate within a plurality of data processing en- 
vironments. Additionally, although the present invention 
has been described using a particular series of transac- 
tions and steps, it should be apparent to those skilled in 
the art that the scope of the present invention is not lim- 
ited to the described series of transactions and steps. 
[0109] Further, while the present invention has been 
described using a particular combination of hardware 
and software, it should be recognized that other combi- 
nations of hardware and software are also within the 
scope of the present invention. The present invention 
may be implemented only in hardware, or only in soft- 
ware, or using combinations thereof. 
[01 1 0] The specification and drawings are, according- 
ly, to be regarded in an illustrative rather than a restric- 
tive sense. It will, however, be evident that additions, 
subtractions, deletions, and other modifications and 
changes may be made thereunto without departing from 
the broader spirit and scope of the invention as set forth 
in the claims. 



Claims 

1. A method of storing information for a paper docu- 
ment, the method comprising: 

obtaining an electronic representation of a first 
paper document; 

determining a first file descriptor based upon 
the electronic representation of the first paper 
document; 

determining a feature descriptor from a first da- 
tabase that matches the first feature descriptor, 
the first database storing information identify- 
ing a plurality of feature descriptors and elec- 
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tronic document information for each feature 
descriptor in the plurality of feature descriptors, 
wherein the electronic document information 
for a feature descriptor comprises information 
identifying an electronic document; 
determining, from the first database, electronic 
document information associated with the 
matching feature descriptor; and 
storing an identification code associated with 
an identification tag and the electronic docu- 
ment information associated with the matching 
feature descriptor in a second database such 
that the electronic document information asso- 
ciated with the matching feature descriptor is 
associated with the identification code. 



storing the electronic document information as- 
sociated with the matching feature descriptor 
on the identification tag; and 
physically associating the identification tag with 
5 the first paper document. 

10. The method of claim 1 wherein obtaining the elec- 
tronic representation of the first paper document 
comprises obtaining a scanned image of the first 

w paper document. 

1 1 . The method of claim 1 wherein determining the first 
file descriptor comprises: 

'5 selecting a portion of the electronic represen- 

tation of the first paper document; and 
extracting the first feature descriptor from the 
selected portion of the electronic representa- 
tion of the first paper document. 

12. The method of claim 11 wherein extracting the first 
feature descriptor from the selected portion of the 
first paper document comprises: 

identifying one or more zones on the selected 
portion of the electronic representation of the 
first paper document; and 
extracting the first feature descriptor from at 
least one zone from the one or more zones. 

13. The method of claim 12 wherein extracting the first 
feature descriptor from the at least one zone from 
the one or more zones comprises applying optical 
character recognition techniques to the at least one 
zone to extract the first feature descriptor. 

14. A method of performing an operation using a paper 
document, the method comprising: 

reading a first identification code from an iden- 
tification tag that is physically associated with 
a first paper document; 

determining an identification code from a first 
database that matches the first identification 
code, the first database storing information 
identifying a plurality of identification codes and 
electronic document information for each iden- 
tification code in the plurality of identification 
codes, the electronic document information for 
each identification code comprising information 
identifying an electronic document; 
determining, from the first database, electronic 
document information associated with the 
matching identification code; and 
performing the operation using the electronic 
document identified by the electronic document 
information associated with the matching iden- 
tification code. 



2. The method of claim 1 wherein the electronic doc- 
ument information for a feature descriptor in the plu- 
rality of feature descriptors comprises information 
identifying a storage location of the electronic doc- 20 
ument identified by the electronic document infor- 
mation for the feature descriptor. 

3. The method of claim 1 wherein the electronic doc- 
ument information for a feature descriptor in the plu- 25 
rality of feature descriptors comprises a uniform re- 
source identifier (LRI). 

4. The method of claim 1 further comprising generat- 
ing a second paper document based upon the elec- 30 
tronic representation of the first paper document. 

5. The method of claim 4 wherein generating the sec- 
ond paper document comprises physically associ- 
ating the identification tag with the second paper 35 
document. 

6. The method of claim 1 further comprising physically 
associating the identification tag with the first paper 
document. 40 

7. The method of claim 1 wherein the identification tag 
is physically associated with the first paper docu- 
ment. 

45 

8. The method of claim 1 further comprising: 

storing the electronic document information as- 
sociated with the matching feature descriptor 
on the identification tag; and so 
generating a second paper document based 
upon the electronic representation of the first 
paper document, wherein the identification tag 
is physically associated with the second paper 
document. 55 

9. The method of claim 1 further comprising: 
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1 5. The method of claim 1 4 wherein performing the op- 
eration using comprises generating a second paper 
document using the electronic document identified 
by the electronic document information associated 
with the matching identification code. 

16. A system for processing a paper document, the sys- 
tem comprising: 

a processor; and 

a memory coupled to the processor, the mem- 
ory configured to store a plurality of code mod- 
ules for execution by the processor, the plurality 
of code modules comprising: 

a code module for obtaining an electronic 
representation of a first paper document; 
a code module for determining a first file 
descriptor based upon the electronic rep- 
resentation of the first paper document; 
a code module for determining a feature 
descriptor from a first database that match- 
es the first feature descriptor, the first da- 
tabase storing information identifying a plu- 
rality of feature descriptors and electronic 
document information for each feature de- 
scriptor in the plurality of feature descrip- 
tors, wherein the electronic document in- 
formation for a feature descriptor compris- 
es information identifying an electronic 
document; 

a code module for determining, from the 
first database, electronic document infor- 
mation associated with the matching fea- 
ture descriptor; and 

a code module for storing an identification 
code associated with an identification tag 
and the electronic document information 
associated with the matching feature de- 
scriptor in a second database such that the 
electronic document information associat- 
ed with the matching feature descriptor is 
associated with the identification code. 

17. The system of claim 16 wherein the electronic doc- 
ument information for a feature descriptor in the plu- 
rality of feature descriptors comprises information 
identifying a storage location of the electronic doc- 
ument identified by the electronic document infor- 
mation for the feature descriptor. 

18. The system of claim 16 wherein the electronic doc- 
ument information for a feature descriptor in the plu- 
rality of feature descriptors comprises a uniform re- 
source identifier (URI). 

19. The system of claim 16 further comprising a sub- 
system configured to generate a second paper doc- 



ument based upon the electronic representation of 
the first paper document. 

20. The system of claim 19 wherein the subsystem is 
5 configured to physically associate the identification 

tag with the second paper document. 

21. The system of claim 16 further comprising a sub- 
system configured to physically associate the iden- 

10 tification tag with the first paper document. 

22. The system of claim 1 6 wherein the identification 
tag is physically associated with the first paper doc- 
ument. 

15 

23. The system of claim 1 6: 

wherein the plurality of code modules further 
comprises a code module for storing the elec- 
ta tronic document information associated with 
the matching feature descriptor on the identifi- 
cation tag; and 

the system further comprises a subsystem con- 
figured to generate a second paper document 
25 based upon the electronic representation of the 

first paper document, wherein the identification 
tag is physically associated with the second pa- 
per document. 



wherein the plurality of code modules further 
comprises a code module for storing the elec- 
tronic document information associated with 
the matching feature descriptor on the identifi- 
cation tag; and 

the system further comprises a subsystem for 
physically associating the identification tag with 
the first paper document. 

25. The system of claim 16 wherein the code module 
for obtaining the electronic representation of the 
first paper document comprises a code module for 
obtaining a scanned image of the first paper docu- 
ment. 

26. The system of claim 16 wherein the code module 
for determining the first file descriptor comprises: 

a code module for selecting a portion of the 
electronic representation of the first paper doc- 
ument; and 

a code module for extracting the first feature 
descriptor from the selected portion of the elec- 
tronic representation of the first paper docu- 
ment. 

27. The system of claim 26 wherein the code module 



25 



30 24. The system of claim 16 further comprising: 



35 



40 



45 



50 



16 



31 

for extracting the first feature descriptor from the se- 
lected portion of the first paper document compris- 
es: 

a code module for identifying one or more 5 
zones on the selected portion of the electronic 
representation of the first paper document; and 
a code module for extracting the first feature 
descriptor from at least one zone from the one 
or more zones. 10 

28. The system of claim 27 wherein the code module 
for extracting the first feature descriptor from the at 
least one zone from the one or more zones com- 
prises a code module for applying optical character 15 
recognition techniques to the at least one zone to 
extract the first feature descriptor. 

29. A system for performing an operation using a paper 
document, the system comprising: 20 

a processor; and 

a memory coupled to the processor, the mem- 
ory configured to store a plurality of code mod- 
ules for execution by the processor, the plurality 25 
of code modules comprising: 

a code module for reading a first identifica- 
tion code from an identification tag that is 
physically associated with a first paper 30 
document; 

a code module for determining an identifi- 
cation code from a first database that 
matches the first identification code, the 
first database storing information identify- 35 
ing a plurality of identification codes and 
electronic document information for each 
identification code in the plurality of identi- 
fication codes, the electronic document in- 
formation for each identification code com- 40 
prising information identifying an electronic 
document; 

a code module for determining, from the 
first database, electronic document infor- 
mation associated with the matching iden- 45 
tification code; and 

a code module for causing the operation to 
be performed using the electronic docu- 
ment identified by the electronic document 
information associated with the matching so 
identification code. 

30. The system of claim 29 wherein the code module 
for causing the operation to be performed compris- 
es a code module for causing a first subsystem to ss 
generate a second paper document using the elec- 
tronic document identified by the electronic docu- 
ment information associated with the matching 



32 

identification code. 

31 . A computer program product stored on a computer- 
readable storage medium for processing a paper 
document, the computer program product compris- 
ing: 

code for obtaining an electronic representation 
of a first paper document; 
code for determining a first file descriptor based 
upon the electronic representation of the first 
paper document; 

code for determining a feature descriptor from 
a first database that matches the first feature 
descriptor, the first database storing informa- 
tion identifying a plurality of feature descriptors 
and electronic document information for each 
feature descriptor in the plurality of feature de- 
scriptors, wherein the electronic document in- 
formation for a feature descriptor comprises in- 
formation identifying an electronic document; 
code for determining, from the first database, 
electronic document information associated 
with the matching feature descriptor; and 
code for storing an identification code associ- 
ated with an identification tag and the electronic 
document information associated with the 
matching feature descriptor in a second data- 
base such that the electronic document infor- 
mation associated with the matching feature 
descriptor is associated with the identification 
code. 

32. The computer program product of claim 31 wherein 
the electronic document information for a feature 
descriptor in the plurality of feature descriptors com- 
prises information identifying a storage location of 
the electronic document identified by the electronic 
document information for the feature descriptor. 

33. The computer program product of claim 31 wherein 
the electronic document information for a feature 
descriptor in the plurality of feature descriptors com- 
prises a uniform resource identifier (URI). 

34. The computer program product of claim 31 further 
comprising code for causing a second paper docu- 
ment to be generated based upon the electronic 
representation of the first paper document. 

35. The computer program product of claim 34 wherein 
the code for causing the second paper document to 
bee generated comprises code for causing the 
identification tag to be physically associated with 
the second paper document. 

36. The computer program product of claim 31 further 
comprising code for causing the identification tag to 
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be physically associated with the first paper docu- 
ment. 

37. The computer program product of claim 31 wherein 
the code for storing the identification code compris- 5 
es code for reading the identification code from the 
identification tag physically associated with the first 
paper document. 

38. The computer program product of claim 31 further 10 
comprising: 

code for storing the electronic document infor- 
mation associated with the matching feature 
descriptor on the identification tag; and 15 
code for causing generation of a second paper 
document based upon the electronic represen- 
tation of the first paper document, wherein the 
identification tag is physically associated with 
the second paper document. 20 

39. The computer program product of claim 31 further 
comprising: 

code for storing the electronic document infor- 25 
mation associated with the matching feature 
descriptor on the identification tag; and 
code for causing the identification tag to be 
physically associated with the first paper docu- 
ment. 30 

40. The computer program product of claim 31 wherein 
the code for obtaining the electronic representation 
of the first paper document comprises code for ob- 
taining a scanned image of the first paper docu- 35 
ment. 

41 . The computer program product of claim 31 wherein 
the code for determining the first file descriptor com- 
prises: 40 

code for selecting a portion of the electronic 
representation of the first paper document; and 
code for extracting the first feature descriptor 
from the selected portion of the electronic rep- 45 
resentation of the first paper document. 

42. The computer program product of claim 41 wherein 
the code for extracting the first feature descriptor 
from the selected portion of the first paper docu- so 
ment comprises: 

code for identifying one or more zones on the 
selected portion of the electronic representa- 
tion of the first paper document; and 55 
code for extracting the first feature descriptor 
from at least one zone from the one or more 
zones. 



43. The computer program product of claim 42 wherein 
the code for extracting the first feature descriptor 
from the at least one zone from the one or more 
zones comprises code for. applying optical charac- 
ter recognition techniques to the at least one zone 
to extract the first feature descriptor. 

44. A computer program product stored on a computer- 
readable storage medium for causing an operation 
to be performed using a paper document, the com- 
puter program product comprising: 

code for reading a first identification code from 
an identification tag that is physically associat- 
ed with a first paper document; 
code for determining an identification code from 
a first database that matches the first identifi- 
cation code, the first database storing informa- 
tion identifying a plurality of identification codes 
and electronic document information for each 
identification code in the plurality of identifica- 
tion codes, the electronic document information 
for each identification code comprising informa- 
tion identifying an electronic document; 
code for determining, from the first database, 
electronic document information associated 
with the matching identification code; and 
code for causing the operation to be performed 
using the electronic document identified by the 
electronic document information associated 
with the matching identification code. 

45. The computer program product of claim 44 wherein 
the code for causing the operation to be performed 
using the electronic document comprises code for 
causing a second paper document to be generated 
using the electronic document identified by the elec- 
tronic document information associated with the 
matching identification code. 

46. A system for processing a paper document, the sys- 
tem comprising: 

means for obtaining an electronic representa- 
tion of a first paper document; 
means for determining a first file descriptor 
based upon the electronic representation of the 
first paper document; 

means for determining a feature descriptor 
from a first database that matches the first fea- 
ture descriptor, the first database storing infor- 
mation identifying a plurality of feature descrip- 
tors and electronic document information for 
each feature descriptor in the plurality of fea- 
ture descriptors, wherein the electronic docu- 
ment information for a feature descriptor com- 
prises information identifying an electronic doc- 
ument; 
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means for determining, from the first database, 
electronic document information associated 
with the matching feature descriptor; and 
means for storing an identification code asso- 
ciated with an identification tag and the elec- 5 
tronic document information associated with 
the matching feature descriptor in a second da- 
tabase such that the electronic document infor- 
mation associated with the matching feature 
descriptor is associated with the identification 10 
code. 

47. A system for performing an operation using a paper 
document, the system comprising: 

15 

means for reading a first identification code 
from an identification tag that is physically as- 
sociated with a first paper document; 
means for determining an identification code 
from a first database that matches the first iden- 20 
tification code, the first database storing infor- 
mation identifying a plurality of identification 
codes and electronic document information for 
each identification code in the plurality of iden- 
tification codes, the electronic document infor- 25 
mation for each identification code comprising 
information identifying an electronic document; 
means for determining, from the first database, 
electronic document information associated 
with the matching identification code; and 30 
means for performing the operation using the 
electronic document identified by the electronic 
document information associated with the 
matching identification code. 
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